Global–Local Self-Attention Based Transformer for Speaker Verification

نویسندگان

چکیده

Transformer models are now widely used for speech processing tasks due to their powerful sequence modeling capabilities. Previous work determined an efficient way model speaker embeddings using the by combining transformers with convolutional networks. However, traditional global self-attention mechanisms lack ability capture local information. To alleviate these problems, we proposed a novel global–local mechanism. Instead of or multi-head attention alone, this method performs and in parallel two groups enhance reduce computational cost. better handle location information, introduced locally enhanced encoding verification task. The experimental results VoxCeleb1 test set VoxCeleb2 dev demonstrated improved effect our Compared Transformer-based Robust Embedding Extractor Baseline System, network exhibited performance

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attention-Based Models for Text-Dependent Speaker Verification

Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire length of an input sequence. In this paper, we analyze the usage of attention mechanisms to the problem of sequence summarization in our end-to-end textdepend...

متن کامل

Information based Speaker Verification

We discuss in this paper the conceptual and compu­ tational framework.of informatzon theory for decision making in speaker verification. The proposed approach departs itself from other conventional scoring models for speaker verification as the first approach takes into account the quantity of 'surprise' or information con­ tent. We compare the new approach with a widely used log-likelihood nor...

متن کامل

Best speaker-based structure tree for speaker verification

In this paper we study the use of the Wavelet transform for textdependent speaker verification purposes. A new algorithm to construct the best admissible tree is proposed which has been used to obtain a speaker dependent tree library. Every tree in this library corresponds to the best structure for a given speaker, therefore the extracted parameters from a given tree are well suited and discrim...

متن کامل

Tree based score computation for speaker verification

This paper proposes an original approach to the task of speaker verification, in which the training process consists in a direct modeling of the score function. It divides the parameter space in disjoint regions where a score can be obtained as a simple function of the vector position in the region. The aim of this approach is, on the one hand to overcome some undesirable properties of the gaus...

متن کامل

Autoassociator-based models for speaker verification

In this paper, we propose an autoassociator-based connectionist model that turns out to be very useful for problems of pattern veriication. The model is based on feedforward networks acting as autoassociators trained to reproduce patterns presented at the input to the output layer. The veriication is established on the basis of the distance between the input and the output vectors. We give expe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app121910154